
This document provides instructions to replicate “The Role of Career and Wage Incentives in Labor Productivity: 
Evidence from a Two-stage Field Experiment in Malawi” by Hyuncheol Bryant Kim , Seonghoon Kim , and Thomas T. Kim.

————————————————————————————————————————————————————————————————
Contents
1. Explanation of Data Needed for Replication (STATA data files)
2. Explanation of Stata Codes (dofiles)
3. Step-by-step Procedures to replicate the results 
4. Variable Definitions 
———————————————————————————————————————————————————————————————— 


1. Data (STATA dta files)
- All data files are in the STATA data format (.dta) and analysis codes are in the STATA dofile format (.do).
- A total of three STATA data files are necessary to replicate the findings of the paper.

1.1. enumerator_level.dta
- This dataset contains the information on 443 study participants in the 2014 baseline survey. The 2011 secondary school student survey data and other administrative data collected during the experiment are also included. 

1.2. household_level.dta
- This dataset contains the information on respondents of the 2015 population census. 

1.3. secondary_sch_survey.dta
- This dataset contains the information on the 536 eligible subjects who were male and recent high school graduates in the AFF’s project areas. This dataset was used only for Table A1 and A2.

2. A single STATA dofile, named codes_all.do, executes the procedures to replicate the findings of the paper

3. Step-by-step Procedures to replicate the results 
- Save all datasets and dofiles listed above.
- Set user’s own directories in the dofile accordingly.
- Run the dofiles.

4. Variable Definitions

4.1. Variables in ‘enumerator_level.dta’
id			Individual ID assigned from the 2011 secondary school survey
first_stage		First stage randomization (Internship vs Wage)
second_stage		Second stage randomization (G1, G2, G3, and G4)
quiz			Quiz score from the training session
practice_error		Practice survey error rate from the training session
team_visit1		Supervising team that first visited enumerators
train_accept4		1 if accepted the enumerator job offer
surveytype		Type of the practice survey from the training session
surveypair		Pair of the practice survey from the training session
gp			Practice survey group from the training session
age			Age
num_siblings		Number of siblings
asset_score2		Asset score
current_workstatus2	Currently working
rosenberg2		Self-esteem
intrinsic_value2	Intrinsic motivation
extrinsic_motivation2	Extrinsic motivation
extroversion		Extroversion
agreeableness		Agreeableness
conscientiousness	Conscientiousness
emotional_stability	Emotional stability
openness_to_experiences	Openness to experiences
avgfrac_distant_tpe	Time preference
avgfrac_risk	Risk 	preference
ccei_combined_tpe	Rational decision making ability
ability_index31		MSCE score
ability_index32		Raven and O*NET score
mcoffer2		Male circumcision treatment
eduoffer2		HIV/AIDS education treatment
cct			Scholarship treatment
fortoday		Transportation reimburse
bmi			The body mass index


4.2. Variables in ‘household_level.dta’
id			Individual ID assigned from the 2011 secondary school survey
first_stage		First stage randomization (Internship vs Wage)
second_stage		Second stage randomization (G1, G2, G3, and G4)
quiz			Quiz score from the training session
practice_error		Practice survey error rate from the training session
error_rate		Survey quality
num_dailysurvey		Survey quantity
datenum			Nth day of work from the beginning
attitude		Subjective peformance evaluation by supervisor
pes2q			1 if PES respondents are the previous census respondents
pes3q			Subjective peformance evaluation by survey respondents
hh_hsa_enu		Number of households per enumerator
distance1		Catchment area size
num_hhmember_new	Family size
asset_score_hsa_new	Household asset score
birth_hsa_new		Birth rate
death_hsa_new		Death rate
malaria_hsa_new		Malaria incidence under age 3
age			Age
asset_score2		Asset score
ability_index31		MSCE score
ability_index32		Raven and O*NET score
rosenberg2		Self-esteem
intrinsic_value2	Intrinsic motivation
extrinsic_motivation2	Extrinsic motivation
extroversion		Extroversion
agreeableness		Agreeableness
conscientiousness	Conscientiousness
emotional_stability	Emotional stability
openness_to_experiences	Openness to experiences
avgfrac_distant_tpe	Time preference
avgfrac_risk	Risk 	preference
ccei_combined_tpe	Rational decision making ability
after1_team1		1 if the survey was conducted after the first visit of supervision team 1
after2_team1		1 if the survey was conducted after the second visit of supervision team 1
after1_team2		1 if the survey was conducted after the first visit of supervision team 2
after2_team2		1 if the survey was conducted after the second visit of supervision team 2
after1_team3		1 if the survey was conducted after the first visit of supervision team 3
after2_team3		1 if the survey was conducted after the second visit of supervision team 3
after1_team4		1 if the survey was conducted after the first visit of supervision team 4
after2_team4		1 if the survey was conducted after the second visit of supervision team 4
after1_team5		1 if the survey was conducted after the first visit of supervision team 5
after2_team5		1 if the survey was conducted after the second visit of supervision team 5
cct			Scholarship treatment
mcoffer2		Male circumcision treatment
eduoffer2		HIV/AIDS education treatment
num_siblings		Number of siblings
hsa			HSA identification number
enumerator_pes		PES enumerator
first_superv1		1 if the enumerator experienced the first visit with supervision team 1
first_superv2		1 if the enumerator experienced the first visit with supervision team 2
first_superv3		1 if the enumerator experienced the first visit with supervision team 3
first_superv4		1 if the enumerator experienced the first visit with supervision team 4
first_superv5		1 if the enumerator experienced the first visit with supervision team 5
second_superv1		1 if the enumerator experienced the second visit with supervision team 1
second_superv2		1 if the enumerator experienced the second visit with supervision team 2
second_superv3		1 if the enumerator experienced the second visit with supervision team 3
second_superv4		1 if the enumerator experienced the second visit with supervision team 4
second_superv5		1 if the enumerator experienced the second visit with supervision team 5
wrong_rate		Proportion of entries incorrectly entered
blankall_rate		Proportion of entries incorrectly blank
surveytime2		Survey time per household (in mins)
notsurveytime2		Intermission time between surveys (in mins)
dailyworktime2		Work hours (in mins)
total_3			Training evaluation score
day’n’			Survey day ’n’

4.3. Variables in ‘secondary_sch_survey.dta’
id			Individual ID assigned from the 2011 secondary school survey
grp			Originally assigned first stage group for the target invitee
confirm			1 if subjects participated in the 2014 baseline survey
height_b		Height in 2011
weight_b		Weight in 2011
q103age_b		Age in 2011
living_with_father	Living with a father in 2011
living_with_mother	Living with a mother in 2011
asset_score		Asset score in 2011
self_health_good	Subjective health (good or very good) in 2011
raven11_score		Raven’s matrices test score in 2011
